Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 422 |
| Missing cells | 13 |
| Missing cells (%) | 0.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 46.3 KiB |
| Average record size in memory | 112.3 B |
Variable types
| Unsupported | 1 |
|---|---|
| Numeric | 12 |
| Categorical | 1 |
Year is highly correlated with Murder and 10 other fields | High correlation |
Murder is highly correlated with Year and 10 other fields | High correlation |
Assault on women is highly correlated with Year and 10 other fields | High correlation |
Kidnapping and Abduction is highly correlated with Year and 10 other fields | High correlation |
Dacoity is highly correlated with Year and 10 other fields | High correlation |
Robbery is highly correlated with Year and 10 other fields | High correlation |
Arson is highly correlated with Year and 10 other fields | High correlation |
Hurt is highly correlated with Year and 10 other fields | High correlation |
Prevention of atrocities (POA) Act is highly correlated with Year and 10 other fields | High correlation |
Protection of Civil Rights (PCR) Act is highly correlated with Year and 10 other fields | High correlation |
Other Crimes Against SCs is highly correlated with Year and 10 other fields | High correlation |
TotalCrime is highly correlated with Year and 10 other fields | High correlation |
Murder is highly correlated with Assault on women and 9 other fields | High correlation |
Assault on women is highly correlated with Murder and 8 other fields | High correlation |
Kidnapping and Abduction is highly correlated with Murder and 8 other fields | High correlation |
Dacoity is highly correlated with Murder and 8 other fields | High correlation |
Robbery is highly correlated with Murder and 8 other fields | High correlation |
Arson is highly correlated with Murder and 8 other fields | High correlation |
Hurt is highly correlated with Murder and 8 other fields | High correlation |
Prevention of atrocities (POA) Act is highly correlated with Murder and 9 other fields | High correlation |
Protection of Civil Rights (PCR) Act is highly correlated with Murder and 3 other fields | High correlation |
Other Crimes Against SCs is highly correlated with Murder and 9 other fields | High correlation |
TotalCrime is highly correlated with Murder and 9 other fields | High correlation |
Murder is highly correlated with Assault on women and 7 other fields | High correlation |
Assault on women is highly correlated with Murder and 7 other fields | High correlation |
Kidnapping and Abduction is highly correlated with Murder and 8 other fields | High correlation |
Dacoity is highly correlated with Kidnapping and Abduction and 2 other fields | High correlation |
Robbery is highly correlated with Murder and 8 other fields | High correlation |
Arson is highly correlated with Murder and 8 other fields | High correlation |
Hurt is highly correlated with Murder and 7 other fields | High correlation |
Prevention of atrocities (POA) Act is highly correlated with Murder and 7 other fields | High correlation |
Other Crimes Against SCs is highly correlated with Murder and 7 other fields | High correlation |
TotalCrime is highly correlated with Murder and 7 other fields | High correlation |
Assault on women is highly correlated with TotalCrime and 10 other fields | High correlation |
TotalCrime is highly correlated with Assault on women and 10 other fields | High correlation |
Other Crimes Against SCs is highly correlated with Assault on women and 10 other fields | High correlation |
Kidnapping and Abduction is highly correlated with Assault on women and 10 other fields | High correlation |
Murder is highly correlated with Assault on women and 10 other fields | High correlation |
Protection of Civil Rights (PCR) Act is highly correlated with Assault on women and 10 other fields | High correlation |
Robbery is highly correlated with Assault on women and 10 other fields | High correlation |
Hurt is highly correlated with Assault on women and 10 other fields | High correlation |
Prevention of atrocities (POA) Act is highly correlated with Assault on women and 10 other fields | High correlation |
Year is highly correlated with Assault on women and 10 other fields | High correlation |
Arson is highly correlated with Assault on women and 10 other fields | High correlation |
Dacoity is highly correlated with Assault on women and 10 other fields | High correlation |
Year is highly skewed (γ1 = 20.51828431) | Skewed |
Assault on women is highly skewed (γ1 = 20.20066131) | Skewed |
Robbery is highly skewed (γ1 = 20.010262) | Skewed |
Arson is highly skewed (γ1 = 20.09444272) | Skewed |
Hurt is highly skewed (γ1 = 20.29124178) | Skewed |
Prevention of atrocities (POA) Act is highly skewed (γ1 = 20.18581884) | Skewed |
Other Crimes Against SCs is highly skewed (γ1 = 20.14604382) | Skewed |
TotalCrime is highly skewed (γ1 = 20.51905362) | Skewed |
STATE/UT is uniformly distributed | Uniform |
df_index is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Murder has 198 (46.9%) zeros | Zeros |
Assault on women has 178 (42.2%) zeros | Zeros |
Kidnapping and Abduction has 215 (50.9%) zeros | Zeros |
Dacoity has 329 (78.0%) zeros | Zeros |
Robbery has 278 (65.9%) zeros | Zeros |
Arson has 252 (59.7%) zeros | Zeros |
Hurt has 185 (43.8%) zeros | Zeros |
Prevention of atrocities (POA) Act has 165 (39.1%) zeros | Zeros |
Protection of Civil Rights (PCR) Act has 281 (66.6%) zeros | Zeros |
Other Crimes Against SCs has 163 (38.6%) zeros | Zeros |
Reproduction
| Analysis started | 2021-09-27 03:29:20.025439 |
|---|---|
| Analysis finished | 2021-09-27 03:29:44.353810 |
| Duration | 24.33 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 13 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 1 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4003.467933 |
| Minimum | 2001 |
|---|---|
| Maximum | 842730 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 2001 |
|---|---|
| 5-th percentile | 2001 |
| Q1 | 2004 |
| median | 2007 |
| Q3 | 2010 |
| 95-th percentile | 2012 |
| Maximum | 842730 |
| Range | 840729 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 40974.3564 |
|---|---|
| Coefficient of variation (CV) | 10.23471577 |
| Kurtosis | 420.999994 |
| Mean | 4003.467933 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 20.51828431 |
| Sum | 1685460 |
| Variance | 1678897882 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2010 | 35 | |
| 2002 | 35 | |
| 2008 | 35 | |
| 2005 | 35 | |
| 2004 | 35 | |
| 2001 | 35 | |
| 2003 | 35 | |
| 2011 | 35 | |
| 2006 | 35 | |
| 2012 | 35 | |
| Other values (3) | 71 |
| Value | Count | Frequency (%) |
| 2001 | 35 | |
| 2002 | 35 | |
| 2003 | 35 | |
| 2004 | 35 | |
| 2005 | 35 | |
| 2006 | 35 | |
| 2007 | 35 | |
| 2008 | 35 | |
| 2009 | 35 | |
| 2010 | 35 |
| Value | Count | Frequency (%) |
| 842730 | 1 | 0.2% |
| 2012 | 35 | |
| 2011 | 35 | |
| 2010 | 35 | |
| 2009 | 35 | |
| 2008 | 35 | |
| 2007 | 35 | |
| 2006 | 35 | |
| 2005 | 35 | |
| 2004 | 35 |
| Distinct | 35 |
|---|---|
| Distinct (%) | 8.3% |
| Missing | 2 |
| Missing (%) | 0.5% |
| Memory size | 3.4 KiB |
| NAGALAND | 12 |
|---|---|
| HARYANA | 12 |
| MEGHALAYA | 12 |
| MIZORAM | 12 |
| DAMAN & DIU | 12 |
| Other values (30) |
Length
| Max length | 17 |
|---|---|
| Median length | 9 |
| Mean length | 9.485714286 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3984 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ANDHRA PRADESH |
|---|---|
| 2nd row | ANDHRA PRADESH |
| 3rd row | ANDHRA PRADESH |
| 4th row | ANDHRA PRADESH |
| 5th row | ANDHRA PRADESH |
Common Values
| Value | Count | Frequency (%) |
| NAGALAND | 12 | 2.8% |
| HARYANA | 12 | 2.8% |
| MEGHALAYA | 12 | 2.8% |
| MIZORAM | 12 | 2.8% |
| DAMAN & DIU | 12 | 2.8% |
| TRIPURA | 12 | 2.8% |
| LAKSHADWEEP | 12 | 2.8% |
| UTTAR PRADESH | 12 | 2.8% |
| GUJARAT | 12 | 2.8% |
| KERALA | 12 | 2.8% |
| Other values (25) | 300 |
Length
| Value | Count | Frequency (%) |
| pradesh | 60 | 9.6% |
| 48 | 7.7% | |
| n | 24 | 3.8% |
| uttarakhand | 12 | 1.9% |
| rajasthan | 12 | 1.9% |
| assam | 12 | 1.9% |
| kerala | 12 | 1.9% |
| islands | 12 | 1.9% |
| jharkhand | 12 | 1.9% |
| manipur | 12 | 1.9% |
| Other values (34) | 408 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 828 | |
| H | 360 | 9.0% |
| R | 324 | 8.1% |
| D | 240 | 6.0% |
| N | 216 | 5.4% |
| 204 | 5.1% | |
| S | 204 | 5.1% |
| I | 192 | 4.8% |
| E | 168 | 4.2% |
| M | 168 | 4.2% |
| Other values (15) | 1080 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3732 | |
| Space Separator | 204 | 5.1% |
| Other Punctuation | 48 | 1.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 828 | |
| H | 360 | |
| R | 324 | 8.7% |
| D | 240 | 6.4% |
| N | 216 | 5.8% |
| S | 204 | 5.5% |
| I | 192 | 5.1% |
| E | 168 | 4.5% |
| M | 168 | 4.5% |
| T | 156 | 4.2% |
| Other values (13) | 876 |
Space Separator
| Value | Count | Frequency (%) |
| 204 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 48 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3732 | |
| Common | 252 | 6.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 828 | |
| H | 360 | |
| R | 324 | 8.7% |
| D | 240 | 6.4% |
| N | 216 | 5.8% |
| S | 204 | 5.5% |
| I | 192 | 5.1% |
| E | 168 | 4.5% |
| M | 168 | 4.5% |
| T | 156 | 4.2% |
| Other values (13) | 876 |
Common
| Value | Count | Frequency (%) |
| 204 | ||
| & | 48 | 19.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3984 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 828 | |
| H | 360 | 9.0% |
| R | 324 | 8.1% |
| D | 240 | 6.0% |
| N | 216 | 5.4% |
| 204 | 5.1% | |
| S | 204 | 5.1% |
| I | 192 | 4.8% |
| E | 168 | 4.2% |
| M | 168 | 4.2% |
| Other values (15) | 1080 |
| Distinct | 78 |
|---|---|
| Distinct (%) | 18.5% |
| Missing | 1 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.52969121 |
| Minimum | 0 |
|---|---|
| Maximum | 7900 |
| Zeros | 198 |
| Zeros (%) | 46.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 14 |
| 95-th percentile | 83 |
| Maximum | 7900 |
| Range | 7900 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 387.6744058 |
|---|---|
| Coefficient of variation (CV) | 10.32980537 |
| Kurtosis | 405.6090393 |
| Mean | 37.52969121 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 19.96646559 |
| Sum | 15800 |
| Variance | 150291.4449 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 198 | |
| 1 | 22 | 5.2% |
| 3 | 17 | 4.0% |
| 2 | 11 | 2.6% |
| 4 | 9 | 2.1% |
| 5 | 9 | 2.1% |
| 11 | 7 | 1.7% |
| 12 | 7 | 1.7% |
| 7 | 6 | 1.4% |
| 13 | 6 | 1.4% |
| Other values (68) | 129 |
| Value | Count | Frequency (%) |
| 0 | 198 | |
| 1 | 22 | 5.2% |
| 2 | 11 | 2.6% |
| 3 | 17 | 4.0% |
| 4 | 9 | 2.1% |
| 5 | 9 | 2.1% |
| 6 | 5 | 1.2% |
| 7 | 6 | 1.4% |
| 8 | 4 | 0.9% |
| 9 | 6 | 1.4% |
| Value | Count | Frequency (%) |
| 7900 | 1 | |
| 423 | 1 | |
| 371 | 1 | |
| 323 | 1 | |
| 321 | 1 | |
| 318 | 1 | |
| 310 | 1 | |
| 288 | 1 | |
| 286 | 1 | |
| 239 | 1 |
Assault on women
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 113 |
|---|---|
| Distinct (%) | 26.8% |
| Missing | 1 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 75.6152019 |
| Minimum | 0 |
|---|---|
| Maximum | 15917 |
| Zeros | 178 |
| Zeros (%) | 42.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 3 |
| Q3 | 35 |
| 95-th percentile | 258 |
| Maximum | 15917 |
| Range | 15917 |
| Interquartile range (IQR) | 35 |
Descriptive statistics
| Standard deviation | 777.9565229 |
|---|---|
| Coefficient of variation (CV) | 10.28836138 |
| Kurtosis | 412.2259878 |
| Mean | 75.6152019 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 20.20066131 |
| Sum | 31834 |
| Variance | 605216.3516 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 178 | |
| 1 | 17 | 4.0% |
| 2 | 9 | 2.1% |
| 3 | 8 | 1.9% |
| 11 | 8 | 1.9% |
| 6 | 7 | 1.7% |
| 9 | 7 | 1.7% |
| 5 | 6 | 1.4% |
| 7 | 5 | 1.2% |
| 8 | 5 | 1.2% |
| Other values (103) | 171 |
| Value | Count | Frequency (%) |
| 0 | 178 | |
| 1 | 17 | 4.0% |
| 2 | 9 | 2.1% |
| 3 | 8 | 1.9% |
| 4 | 4 | 0.9% |
| 5 | 6 | 1.4% |
| 6 | 7 | 1.7% |
| 7 | 5 | 1.2% |
| 8 | 5 | 1.2% |
| 9 | 7 | 1.7% |
| Value | Count | Frequency (%) |
| 15917 | 1 | |
| 412 | 2 | |
| 397 | 1 | |
| 375 | 1 | |
| 367 | 1 | |
| 357 | 1 | |
| 349 | 1 | |
| 343 | 1 | |
| 340 | 1 | |
| 335 | 2 |
Kidnapping and Abduction
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 56 |
|---|---|
| Distinct (%) | 13.3% |
| Missing | 1 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.22327791 |
| Minimum | 0 |
|---|---|
| Maximum | 4678 |
| Zeros | 215 |
| Zeros (%) | 50.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 8 |
| 95-th percentile | 36 |
| Maximum | 4678 |
| Range | 4678 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 230.1116138 |
|---|---|
| Coefficient of variation (CV) | 10.35453072 |
| Kurtosis | 401.7343008 |
| Mean | 22.22327791 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 19.83284676 |
| Sum | 9356 |
| Variance | 52951.35479 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 215 | |
| 1 | 31 | 7.3% |
| 2 | 24 | 5.7% |
| 3 | 12 | 2.8% |
| 4 | 11 | 2.6% |
| 5 | 8 | 1.9% |
| 18 | 8 | 1.9% |
| 8 | 7 | 1.7% |
| 6 | 7 | 1.7% |
| 27 | 6 | 1.4% |
| Other values (46) | 92 |
| Value | Count | Frequency (%) |
| 0 | 215 | |
| 1 | 31 | 7.3% |
| 2 | 24 | 5.7% |
| 3 | 12 | 2.8% |
| 4 | 11 | 2.6% |
| 5 | 8 | 1.9% |
| 6 | 7 | 1.7% |
| 7 | 6 | 1.4% |
| 8 | 7 | 1.7% |
| 9 | 3 | 0.7% |
| Value | Count | Frequency (%) |
| 4678 | 1 | |
| 363 | 1 | |
| 258 | 1 | |
| 254 | 1 | |
| 248 | 1 | |
| 219 | 2 | |
| 153 | 1 | |
| 130 | 1 | |
| 113 | 1 | |
| 99 | 1 |
| Distinct | 16 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 1 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.876484561 |
| Minimum | 0 |
|---|---|
| Maximum | 395 |
| Zeros | 329 |
| Zeros (%) | 78.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 6 |
| Maximum | 395 |
| Range | 395 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 19.39425401 |
|---|---|
| Coefficient of variation (CV) | 10.33541891 |
| Kurtosis | 404.722246 |
| Mean | 1.876484561 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 19.93360003 |
| Sum | 790 |
| Variance | 376.1370886 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 329 | |
| 1 | 26 | 6.2% |
| 3 | 15 | 3.6% |
| 2 | 12 | 2.8% |
| 4 | 9 | 2.1% |
| 5 | 7 | 1.7% |
| 7 | 7 | 1.7% |
| 8 | 3 | 0.7% |
| 16 | 3 | 0.7% |
| 6 | 3 | 0.7% |
| Other values (6) | 7 | 1.7% |
| Value | Count | Frequency (%) |
| 0 | 329 | |
| 1 | 26 | 6.2% |
| 2 | 12 | 2.8% |
| 3 | 15 | 3.6% |
| 4 | 9 | 2.1% |
| 5 | 7 | 1.7% |
| 6 | 3 | 0.7% |
| 7 | 7 | 1.7% |
| 8 | 3 | 0.7% |
| 9 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 395 | 1 | 0.2% |
| 22 | 1 | 0.2% |
| 20 | 1 | 0.2% |
| 17 | 1 | 0.2% |
| 16 | 3 | |
| 11 | 2 | 0.5% |
| 9 | 1 | 0.2% |
| 8 | 3 | |
| 7 | 7 | |
| 6 | 3 |
Robbery
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 25 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 1 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.527315914 |
| Minimum | 0 |
|---|---|
| Maximum | 953 |
| Zeros | 278 |
| Zeros (%) | 65.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 13 |
| Maximum | 953 |
| Range | 953 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 46.73468519 |
|---|---|
| Coefficient of variation (CV) | 10.32282396 |
| Kurtosis | 406.7279493 |
| Mean | 4.527315914 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 20.010262 |
| Sum | 1906 |
| Variance | 2184.1308 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 278 | |
| 1 | 40 | 9.5% |
| 3 | 17 | 4.0% |
| 4 | 11 | 2.6% |
| 2 | 10 | 2.4% |
| 6 | 9 | 2.1% |
| 5 | 7 | 1.7% |
| 11 | 5 | 1.2% |
| 10 | 5 | 1.2% |
| 8 | 5 | 1.2% |
| Other values (15) | 34 | 8.1% |
| Value | Count | Frequency (%) |
| 0 | 278 | |
| 1 | 40 | 9.5% |
| 2 | 10 | 2.4% |
| 3 | 17 | 4.0% |
| 4 | 11 | 2.6% |
| 5 | 7 | 1.7% |
| 6 | 9 | 2.1% |
| 7 | 5 | 1.2% |
| 8 | 5 | 1.2% |
| 9 | 4 | 0.9% |
| Value | Count | Frequency (%) |
| 953 | 1 | 0.2% |
| 83 | 1 | 0.2% |
| 37 | 1 | 0.2% |
| 24 | 2 | |
| 22 | 1 | 0.2% |
| 20 | 2 | |
| 19 | 3 | |
| 17 | 4 | |
| 16 | 1 | 0.2% |
| 15 | 2 |
Arson
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 50 |
|---|---|
| Distinct (%) | 11.9% |
| Missing | 1 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.90736342 |
| Minimum | 0 |
|---|---|
| Maximum | 2717 |
| Zeros | 252 |
| Zeros (%) | 59.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 5 |
| 95-th percentile | 39 |
| Maximum | 2717 |
| Range | 2717 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 133.0391581 |
|---|---|
| Coefficient of variation (CV) | 10.30722958 |
| Kurtosis | 409.1996997 |
| Mean | 12.90736342 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 20.09444272 |
| Sum | 5434 |
| Variance | 17699.41759 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 252 | |
| 1 | 25 | 5.9% |
| 2 | 17 | 4.0% |
| 4 | 10 | 2.4% |
| 5 | 10 | 2.4% |
| 3 | 8 | 1.9% |
| 7 | 7 | 1.7% |
| 12 | 6 | 1.4% |
| 10 | 6 | 1.4% |
| 13 | 5 | 1.2% |
| Other values (40) | 75 | 17.8% |
| Value | Count | Frequency (%) |
| 0 | 252 | |
| 1 | 25 | 5.9% |
| 2 | 17 | 4.0% |
| 3 | 8 | 1.9% |
| 4 | 10 | 2.4% |
| 5 | 10 | 2.4% |
| 6 | 3 | 0.7% |
| 7 | 7 | 1.7% |
| 8 | 5 | 1.2% |
| 9 | 4 | 0.9% |
| Value | Count | Frequency (%) |
| 2717 | 1 | |
| 178 | 1 | |
| 103 | 1 | |
| 76 | 1 | |
| 66 | 1 | |
| 62 | 1 | |
| 61 | 1 | |
| 57 | 1 | |
| 53 | 2 | |
| 51 | 1 |
Hurt
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 172 |
|---|---|
| Distinct (%) | 40.9% |
| Missing | 1 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 233.5106888 |
| Minimum | 0 |
|---|---|
| Maximum | 49154 |
| Zeros | 185 |
| Zeros (%) | 43.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 4 |
| Q3 | 149 |
| 95-th percentile | 626 |
| Maximum | 49154 |
| Range | 49154 |
| Interquartile range (IQR) | 149 |
Descriptive statistics
| Standard deviation | 2398.808699 |
|---|---|
| Coefficient of variation (CV) | 10.27280041 |
| Kurtosis | 414.7471934 |
| Mean | 233.5106888 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 20.29124178 |
| Sum | 98308 |
| Variance | 5754283.174 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 185 | |
| 1 | 13 | 3.1% |
| 4 | 7 | 1.7% |
| 2 | 7 | 1.7% |
| 3 | 5 | 1.2% |
| 48 | 5 | 1.2% |
| 9 | 4 | 0.9% |
| 7 | 4 | 0.9% |
| 23 | 3 | 0.7% |
| 12 | 3 | 0.7% |
| Other values (162) | 185 |
| Value | Count | Frequency (%) |
| 0 | 185 | |
| 1 | 13 | 3.1% |
| 2 | 7 | 1.7% |
| 3 | 5 | 1.2% |
| 4 | 7 | 1.7% |
| 5 | 2 | 0.5% |
| 6 | 3 | 0.7% |
| 7 | 4 | 0.9% |
| 8 | 2 | 0.5% |
| 9 | 4 | 0.9% |
| Value | Count | Frequency (%) |
| 49154 | 1 | |
| 1252 | 1 | |
| 950 | 1 | |
| 900 | 1 | |
| 890 | 1 | |
| 877 | 1 | |
| 858 | 1 | |
| 821 | 1 | |
| 815 | 1 | |
| 722 | 1 |
Prevention of atrocities (POA) Act
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 191 |
|---|---|
| Distinct (%) | 45.4% |
| Missing | 1 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 591.7244656 |
| Minimum | 0 |
|---|---|
| Maximum | 124558 |
| Zeros | 165 |
| Zeros (%) | 39.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 8 |
| Q3 | 267 |
| 95-th percentile | 1514 |
| Maximum | 124558 |
| Range | 124558 |
| Interquartile range (IQR) | 267 |
Descriptive statistics
| Standard deviation | 6089.419785 |
|---|---|
| Coefficient of variation (CV) | 10.29097179 |
| Kurtosis | 411.8054618 |
| Mean | 591.7244656 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 20.18581884 |
| Sum | 249116 |
| Variance | 37081033.31 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 165 | |
| 1 | 26 | 6.2% |
| 2 | 10 | 2.4% |
| 4 | 4 | 0.9% |
| 36 | 4 | 0.9% |
| 24 | 3 | 0.7% |
| 53 | 3 | 0.7% |
| 150 | 2 | 0.5% |
| 21 | 2 | 0.5% |
| 32 | 2 | 0.5% |
| Other values (181) | 200 |
| Value | Count | Frequency (%) |
| 0 | 165 | |
| 1 | 26 | 6.2% |
| 2 | 10 | 2.4% |
| 3 | 2 | 0.5% |
| 4 | 4 | 0.9% |
| 5 | 2 | 0.5% |
| 8 | 2 | 0.5% |
| 10 | 1 | 0.2% |
| 12 | 1 | 0.2% |
| 13 | 2 | 0.5% |
| Value | Count | Frequency (%) |
| 124558 | 1 | |
| 4885 | 1 | |
| 4436 | 1 | |
| 3072 | 1 | |
| 3024 | 1 | |
| 2974 | 1 | |
| 2965 | 1 | |
| 2554 | 1 | |
| 2548 | 1 | |
| 2534 | 1 |
Protection of Civil Rights (PCR) Act
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 57 |
|---|---|
| Distinct (%) | 13.5% |
| Missing | 1 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.28503563 |
| Minimum | 0 |
|---|---|
| Maximum | 4270 |
| Zeros | 281 |
| Zeros (%) | 66.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2 |
| 95-th percentile | 68 |
| Maximum | 4270 |
| Range | 4270 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 210.6970619 |
|---|---|
| Coefficient of variation (CV) | 10.38682237 |
| Kurtosis | 396.759567 |
| Mean | 20.28503563 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 19.66194537 |
| Sum | 8540 |
| Variance | 44393.25189 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 281 | |
| 1 | 30 | 7.1% |
| 2 | 12 | 2.8% |
| 3 | 11 | 2.6% |
| 12 | 5 | 1.2% |
| 10 | 4 | 0.9% |
| 5 | 4 | 0.9% |
| 20 | 4 | 0.9% |
| 4 | 3 | 0.7% |
| 26 | 3 | 0.7% |
| Other values (47) | 64 | 15.2% |
| Value | Count | Frequency (%) |
| 0 | 281 | |
| 1 | 30 | 7.1% |
| 2 | 12 | 2.8% |
| 3 | 11 | 2.6% |
| 4 | 3 | 0.7% |
| 5 | 4 | 0.9% |
| 6 | 3 | 0.7% |
| 7 | 1 | 0.2% |
| 8 | 3 | 0.7% |
| 9 | 2 | 0.5% |
| Value | Count | Frequency (%) |
| 4270 | 1 | |
| 459 | 1 | |
| 312 | 1 | |
| 198 | 1 | |
| 165 | 1 | |
| 133 | 1 | |
| 123 | 1 | |
| 122 | 2 | |
| 113 | 1 | |
| 101 | 1 |
Other Crimes Against SCs
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 186 |
|---|---|
| Distinct (%) | 44.2% |
| Missing | 1 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 758.631829 |
| Minimum | 0 |
|---|---|
| Maximum | 159692 |
| Zeros | 163 |
| Zeros (%) | 38.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 6 |
| Q3 | 283 |
| 95-th percentile | 2645 |
| Maximum | 159692 |
| Range | 159692 |
| Interquartile range (IQR) | 283 |
Descriptive statistics
| Standard deviation | 7812.209512 |
|---|---|
| Coefficient of variation (CV) | 10.29776133 |
| Kurtosis | 410.7122497 |
| Mean | 758.631829 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 20.14604382 |
| Sum | 319384 |
| Variance | 61030617.45 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 163 | |
| 1 | 22 | 5.2% |
| 2 | 10 | 2.4% |
| 3 | 6 | 1.4% |
| 4 | 5 | 1.2% |
| 6 | 4 | 0.9% |
| 31 | 3 | 0.7% |
| 22 | 3 | 0.7% |
| 13 | 3 | 0.7% |
| 127 | 3 | 0.7% |
| Other values (176) | 199 |
| Value | Count | Frequency (%) |
| 0 | 163 | |
| 1 | 22 | 5.2% |
| 2 | 10 | 2.4% |
| 3 | 6 | 1.4% |
| 4 | 5 | 1.2% |
| 5 | 2 | 0.5% |
| 6 | 4 | 0.9% |
| 8 | 2 | 0.5% |
| 9 | 2 | 0.5% |
| 10 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 159692 | 1 | |
| 4771 | 1 | |
| 4536 | 1 | |
| 4296 | 1 | |
| 4239 | 1 | |
| 4014 | 1 | |
| 3974 | 1 | |
| 3795 | 1 | |
| 3683 | 1 | |
| 3676 | 1 |
TotalCrime
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWED| Distinct | 238 |
|---|---|
| Distinct (%) | 56.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22994.5782 |
| Minimum | 0 |
|---|---|
| Maximum | 4851856 |
| Zeros | 1 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 8012 |
| Q1 | 8036 |
| median | 8212 |
| Q3 | 12604 |
| 95-th percentile | 25600.4 |
| Maximum | 4851856 |
| Range | 4851856 |
| Interquartile range (IQR) | 4568 |
Descriptive statistics
| Standard deviation | 235713.6572 |
|---|---|
| Coefficient of variation (CV) | 10.25083631 |
| Kurtosis | 421.351172 |
| Mean | 22994.5782 |
| Median Absolute Deviation (MAD) | 202 |
| Skewness | 20.51905362 |
| Sum | 9703712 |
| Variance | 5.556092819 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8028 | 18 | 4.3% |
| 8032 | 18 | 4.3% |
| 8044 | 16 | 3.8% |
| 8048 | 15 | 3.6% |
| 8040 | 15 | 3.6% |
| 8016 | 14 | 3.3% |
| 8012 | 12 | 2.8% |
| 8008 | 12 | 2.8% |
| 8036 | 10 | 2.4% |
| 8020 | 9 | 2.1% |
| Other values (228) | 283 |
| Value | Count | Frequency (%) |
| 0 | 1 | 0.2% |
| 8004 | 7 | 1.7% |
| 8008 | 12 | |
| 8012 | 12 | |
| 8016 | 14 | |
| 8020 | 9 | |
| 8024 | 8 | |
| 8028 | 18 | |
| 8032 | 18 | |
| 8036 | 10 |
| Value | Count | Frequency (%) |
| 4851856 | 1 | |
| 50932 | 1 | |
| 40068 | 1 | |
| 39716 | 1 | |
| 38852 | 1 | |
| 38124 | 1 | |
| 36876 | 1 | |
| 33128 | 1 | |
| 32856 | 1 | |
| 32604 | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | Year | STATE/UT | Murder | Assault on women | Kidnapping and Abduction | Dacoity | Robbery | Arson | Hurt | Prevention of atrocities (POA) Act | Protection of Civil Rights (PCR) Act | Other Crimes Against SCs | TotalCrime | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 2001.0 | ANDHRA PRADESH | 45.0 | 69.0 | 22.0 | 3.0 | 2.0 | 6.0 | 518.0 | 950.0 | 312.0 | 1006.0 | 19736.0 |
| 1 | 1 | 2002.0 | ANDHRA PRADESH | 60.0 | 98.0 | 18.0 | 0.0 | 4.0 | 12.0 | 568.0 | 830.0 | 459.0 | 1336.0 | 21548.0 |
| 2 | 2 | 2003.0 | ANDHRA PRADESH | 33.0 | 79.0 | 27.0 | 1.0 | 15.0 | 4.0 | 615.0 | 1234.0 | 165.0 | 1386.0 | 22248.0 |
| 3 | 3 | 2004.0 | ANDHRA PRADESH | 39.0 | 66.0 | 28.0 | 0.0 | 7.0 | 20.0 | 474.0 | 1319.0 | 68.0 | 1234.0 | 21036.0 |
| 4 | 4 | 2005.0 | ANDHRA PRADESH | 37.0 | 74.0 | 21.0 | 0.0 | 0.0 | 9.0 | 459.0 | 1244.0 | 61.0 | 1212.0 | 20488.0 |
| 5 | 5 | 2006.0 | ANDHRA PRADESH | 52.0 | 97.0 | 12.0 | 3.0 | 5.0 | 13.0 | 657.0 | 1514.0 | 93.0 | 1445.0 | 23588.0 |
| 6 | 6 | 2007.0 | ANDHRA PRADESH | 46.0 | 105.0 | 25.0 | 0.0 | 0.0 | 17.0 | 541.0 | 1200.0 | 122.0 | 1327.0 | 21560.0 |
| 7 | 7 | 2008.0 | ANDHRA PRADESH | 48.0 | 88.0 | 18.0 | 0.0 | 0.0 | 5.0 | 651.0 | 1383.0 | 123.0 | 1682.0 | 24024.0 |
| 8 | 8 | 2009.0 | ANDHRA PRADESH | 35.0 | 99.0 | 19.0 | 1.0 | 4.0 | 12.0 | 722.0 | 1737.0 | 39.0 | 1836.0 | 26052.0 |
| 9 | 9 | 2010.0 | ANDHRA PRADESH | 43.0 | 100.0 | 18.0 | 0.0 | 1.0 | 17.0 | 709.0 | 1509.0 | 50.0 | 1874.0 | 25324.0 |
Last rows
| df_index | Year | STATE/UT | Murder | Assault on women | Kidnapping and Abduction | Dacoity | Robbery | Arson | Hurt | Prevention of atrocities (POA) Act | Protection of Civil Rights (PCR) Act | Other Crimes Against SCs | TotalCrime | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 412 | 412 | 2005.0 | PUDUCHERRY | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 | 12.0 | 0.0 | 8076.0 |
| 413 | 413 | 2006.0 | PUDUCHERRY | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 14.0 | 0.0 | 8080.0 |
| 414 | 414 | 2007.0 | PUDUCHERRY | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 24.0 | 0.0 | 8128.0 |
| 415 | 415 | 2008.0 | PUDUCHERRY | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 | 27.0 | 0.0 | 8148.0 |
| 416 | 416 | 2009.0 | PUDUCHERRY | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 3.0 | 26.0 | 0.0 | 8152.0 |
| 417 | 417 | 2010.0 | PUDUCHERRY | 1.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 1.0 | 2.0 | 26.0 | 0.0 | 8164.0 |
| 418 | 418 | 2011.0 | PUDUCHERRY | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 15.0 | 2.0 | 8116.0 |
| 419 | 419 | 2012.0 | PUDUCHERRY | 2.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 1.0 | 20.0 | 1.0 | 8144.0 |
| 420 | TotalCrime | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.0 |
| 421 | Column_Total | 842730.0 | NaN | 7900.0 | 15917.0 | 4678.0 | 395.0 | 953.0 | 2717.0 | 49154.0 | 124558.0 | 4270.0 | 159692.0 | 4851856.0 |